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Abstract 

We consider a multi-user multiple-input multiple-output (MU-MIMO) system that uses orthogonal 
frequency division multiplexing (OFDM). Several receivers are developed for data detection of MU- 
MIMO transmissions where two users share the same OFDM time and frequency resources. The receivers 
have partial state information about the MU-MIMO transmission with each receiver having knowledge of 
the MU-MIMO channel, however the modulation constellation of the co-scheduled user is unknown. We 
propose a joint maximum likelihood (ML) modulation classification of the co-scheduled user and data 
detection receiver using the max-log-MAP approximation. It is shown that the decision metric for the 
modulation classification is an accumulation over a set of tones of Euclidean distance computations that 
are also used by the max-log-MAP detector for bit log-likelihood ratio (LLR) soft decision generation. 

An efficient hardware implementation emerges that exploits this commonality between the classihcation 
and detection steps and results in sharing of the hardware resources. Comparisons of the link performance 
of the proposed receiver to several linear receivers is demonstrated through computer simulations. It is 
shown that the proposed receiver offers [1.5]dB improvement in signal-to-noise ratio (SNR) over the 
nulling projection receiver at 1% block error rate (BLER) for 64-QAM with turbo code rate of 1/2 
in the case of zero transmit and receiver antenna correlations. However, in the case of high antenna 
correlation, the linear receiver approaches suffer significant loss relative to the optimal receiver. 

Index Terms 

Constellation estimation, MIMO detection, MU-MIMO 

I. Introduction 

As the demand for wireless eommunieation inereases, there is a greater ehallenge for improving 
network capacity. Allocating more spectrum by the aggregation of several component carriers 
is a method used to increase the delivered peak data rate and capacity ||T|. However, carrier 


2 


aggregation comes at a substantial capital expenditure as the cellular network operators require 
first the acquisition of new spectrum and second the associated high network deployment costs 
to bring these new networks to commercial service. An alternative to spectrum expansion is to 
exploit the spatial dimension using multiple antennas at the transmitter and receiver, also referred 
to as MIMO Q, which has been successfully used to increase the link throughput and network 
capacity for several wireless communications standards Amongst the many transmission 

modes of MIMO Q, multi-user MIMO (MU-MIMO) was proposed as a method for increasing 
the capacity of a wireless network Q. In the LTE standard as an example, which uses MIMO 
and orthogonal frequency-division multiplexing (OFDM), multiple users on the downlink or 
uplink can be scheduled for transmission on the same physical resource blocks (PRBs) Q. The 
users are paired together such that their cascaded precoder-channel vectors are as orthogonal as 
possible. Therefore, the capacity is doubled by having the two users share the same time and 
frequency resource elements. In the downlink, the base station sends pilots over the resource 
blocks where the users are co-scheduled which are used to estimate the full MU-MIMO channel. 
For example, in transmission modes 7, 8, and 9 of the FTE standard, the base station transmits 
pilots with orthogonal cover codes that enable the estimation of the desired and co-scheduled 
users’ channels. The inability of the two users’ signals to appear as orthogonal waveforms to the 
receiver stems from: (1) the finite set of available precoders specified by the standard (to satisfy 
limited information feedback to be transmitted in the case of frequency division duplex systems) 
result in non-orthogonality between the users, and (2) during the potential delay between when 
users are co-scheduled and when the MU-MIMO transmission occurs, the channel state would 
have changed and thus the orthogonality between the users is lost, typical in the case of high 
Doppler scenarios. The receivers have only partial state information, since each user receiver 
has the ability to estimate the MU-MIMO channel, however the modulation constellation of the 
co-scheduled user remains to be unknown. 

Several receiver processing methods at the user equipment (UE) have been proposed for MU- 
MIMO systems. One approach is to treat the co-scheduled user in MU-MIMO just as additional 
noise. This type of processing results in sub-optimal receiver performance as depicted in Q. 
Another approach is to employ linear processing using minimum mean-squared error (MMSE) 


or interference rejection combining (IRC) as shown in [I0|, where only the knowledge of the 
channel estimate of the co-scheduled interfering user is used in the detection of the desired 
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user. In 10. p^ , the eonstellation size of the interfering user’s signal is presumed to be 16- 
QAM regardless of its aetual size, and without making any attempt to estimate it, an maximum 
likelihood (ML) deteetor is used to separate the two users. 

In this work, we eonsider optimal deteetion methods for MU-MIMO systems. We argue that 
deteetion performanee ean be enhaneed if the UE reeeiver treats the eo-seheduled user’s signal 
as a eonstrained unknown to be estimated rather than just as additional random noise. We 
employ joint/eonditional ML deteetion, where the eonstellation size of the eo-seheduled user’s 
signal needs to be estimated first before symbol deteetion and deeoding are performed. We 
address the eo-seheduled user’s eonstellation size estimation problem as part of advaneed MU- 
MIMO deteetion teehniques. We show that the optimal MU-MIMO deteetor ean be efficiently 
implemented with a slight modifieation of the ML MIMO deteetor. 

The rest of the paper is organized as follows. The system model is deseribed in Seetion 
1^ Linear reeeiver methods are diseussed in Seetion IIT while the proposed joint optimal 
eonstellation size estimation and deteetion seheme is presented in Seetion |V| In Seetion |V^ an 


effieient hardware arehiteeture for the proposed MU-MIMO reeeiver is presented. Seetion VII 
diseusses simulation results, while Seetion |VIII| eoneludes the paper. 

Notation: Unless otherwise stated, lower ease and upper ease bold letters denote eolumn 
veetors and matriees, respeetively; denotes the identity matrix of size Nr', (■)* denotes the 
eomplex eonjugate transpose operation; denotes the transpose operation; and E[Z] is the 
expeetation of the random variable Z. 


II. System Model 

Consider a generie OLDM-based MU-MIMO system where K users are seheduled on the 
same PRBs. Denote by N the number of tones in eaeh PRB, and assume that eaeh UE is 
equipped with Nr antennas. The reeeived frequeney-domain eomplex signal yj at the 

UE of interest on the fih resouree element over whieh the K users are seheduled is given by 

Yi = HjXj -f rij, i = 1, 2, • • • , iV, 


where Hj G is the eomplex ehannel matrix, Xj denotes the transmitted K xl QAM 

symbol veetor for K users, and rij G denotes thermal noise modeled as a zero-mean 

eomplex Gaussian random veetor with eovarianee matrix E[njn*] = R = a'^lwr- 
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We consider the practical example where K = Nr = 
resource element can be written as follows 

Yi = + rij, 


2 , so that the received signal at the ith 

i = !,■■■ ,N, 


where , and and ^ denote the cascade of the channel and the precoders of 

the user 1 and user 2 , respectively. 

Let user 1 denote the user of interest while user 2 denote the interfering (co-scheduled) user 
whose constellation size is unknown to user 1. Denote Ms and Mj as the constellations of user 
1 and user 2, respectively, and N as the number of resource elements over which Mj is constant. 
The symbols and x ^2 represent log 2 (M 5 ) and log 2 (M/) coded bits from a channel encoder 
(e.g., turbo code), respectively. The jth bit of symbol x^'* is denoted by 6 ^. 

III. Linear Receiver Methods 


Two linear detection methods for MU-MIMO based on IRC are considered. They differ in 
terms of their complexity and performance. 


A. Covariance-based Linear Receiver 


This receiver structure estimates the interference covariance matrix using knowledge of only 
the desired user’s channel, thus having least knowledge about the co-scheduled user. Denote 

iv) 

x{ and Yp as the known pilot and received vector, respectively, on the pth tone, where p = 
1, 2, ■ ■ ■ ^Np and Np is the number of pilot tones amongst the set of N tones over which the 

two users are co-scheduled. Then, the interference sample covariance matrix is estimated as 

^ Nv 

Dp Up, 


p p=l 


where Up = yp — Note that there is a single interference covariance matrix generated 

per PRB, i.e. Ruu is a sample covariance, obtained by averaging over the pilot tones in the PRB 
and thus is independent of the tone index i. The weight vector for the linear receiver is given 


by 0 


W 


(i) 

IjCOV 




(*) 


( 1 ) 


The desired user receiver applies this weight vector ([T]) to the received vector to obtain = 

2 ^ 1 ,cov, which is a scalar used to generate the soft decision metric for decoding. 
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B. Linear IRC Receiver 


An improvement over the eovarianee-based linear reeeiver is obtained wherein the UE reeeiver 
uses estimates of both the desired ehannel as well as the eo-seheduled user’s ehannel. We arrive 


at an IRC weight veetor that is given by [ 10| 


w 


(i) 

1,IRC 


= ( + R, 


-1 


hi 


(*) 


The desired user reeeiver proeessing is obtained by 

( 2 ) ( 2 )* 

^l,IRC = Wl.IRCy*’ 

whieh ean be approximated by a eomplex Gaussian random varianee, i.e. p I h^ 

CM where the mean and varianee are given by 


= wljRchi'xl' 


"J = wg", + R, j W-,,, 

The LLR of the jth bit of the desired user QAM symbol using the max-log-MAP approxi¬ 
mation is given by 




rC) 


A(M = -^ 


z/f 




min 

(i) 

^ 1 ,IRC i^z 


mm 


d) 

^l,IRC b'z 


where : bij = +lj and Mg = : 6jj = —1|. 

IV. Reduced Complexity Constellation Size Estimation 

We estimate the eonstellation size of the eo-seheduled user by nulling out the unknown signal 
of the user of interest using a linear filter. The equivalent system after applying this linear filter 
to the reeeived veetor is given by the following single-input single output relation 


dd . 




M . 


~ 5): 5); I (2) (2) . Hfi A K. ^) \ ~ 

Vi = SiYi = SiK M +gini = Oi xy +ni, 


(i) 


where g* is ehosen to be orthogonal to i.e., = 0. The nulling filter is given 

by gj = Gc, where c is a 2 x 1 eombining veetor and G is the left null-spaee projeetion matrix 


.(*) 


given by [13| 


G= h 


I hi 


(*)||2 
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Since G has rank 1, the eombining veetor ean simply be c = [1 0]^. 

Next, the ML estimate of the eonstellation is obtained based on the filtered reeeived signal yi 
and ean be approximated as 


Mi ^ arg min 

MiGM 


/ 


N 


N\og{\Mi\) + J2 

i=l 


mm 


(i) 

Vi CliX2 






( 2 ) 


V / 

where log(-) is the natural logarithmie funetion and M. = {0,4-QAM, 16-QAM, 64-QAM} 
denotes the set of allowable eonstellations that the interferer ean assume, ineluding the ease 
when the eo-seheduled user is not present, i.e. 0. The derivation of Q is omitted here as it 
follows elosely the development of the next seetion. Onee the eo-seheduled user eonstellation. 
Ml, is estimated, then the LLR are generated as shown in the next seetion. 


V. Joint Constellation Size Estimation and Detection 
The key idea is to better exploit the diversity offered by the two reeeive antennas. We develop 
an optimal reeeiver for user 1 based on a step of estimating the modulation eonstellation of user 
2 using the reeeived signal and knowledge of the ehannel. We derive the ML estimate of the 
eonstellation size based on the reeeived signal itself y* instead of a redueed-dimension version 


obtained as was done in Seetion IV The optimal ML estimator will inherently average out the 
unknown desired signal using knowledge of its eonstellation size, and yields better results than 
just nulling out the unknown signal. The new ML estimate of the eonstellation of the interfering 


user based on is given by 


Mi = arg max p ( Mi, 

MjeM 


I N 


Sinee X ^ X 2 ^ and iij are independent for i = l, 
eonstellation ean then be written as 

N 

Ml = arg max p (| H*, Ms, M/) 


, N, the ML estimate of the interferer’s 


MjGM 


i=l 

N 


= arg max ^ (y, | H^, Ms, Mj, p p 

= arg max ^^ n E Ep(y*iH^ ,Ms,Mi 

/! t\ .... k 


M,<^M |M/| .. ... 

xi 'eMs 'eMi 


5*^1 5*^2 J 
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where |M/| denotes the size of the interfering user eonstellation. The last equality follows from 
the assumptions that 


= P H*’) = 


Let Xj= c?(xj) = (y* —HjXj)*R ^(y* —HjXj), we ean then write Mj as 

1 


N 


Mi = argmax 


\N 


n exp(-d(xi)). 


Mi&M iMrl . , ,,, 

Using the max-log-MAP approximation, we have that 

( ^ 

M/^^argmin I iVlog (|M/|) + min (i(xj) ) . 

Mi&M \ ^xf&Ms,xf&Mi ' 


(3) 


The simplifieations to @ will beeome evident in the reeeiver implementation Seetion VI As 
seen from the modulation elassifieation metrie is sum over the set of tones, N, over whieh 
the eo-seheduled user is stationary, of the Euelidean distanee between the reeeived veetor and 
the symbol 2-tuple 

Onee the eo-seheduled user’s eonstellation, Mj, is estimated, then the LLR of the jth bit of 


the desired user QAM symbol x[*^ using the max-log-MAP approximation is given by [|14| 


&Mi 


min d(xj), 


(4) 


where and Mg are defined in Seetion 


III-B 


As seen from Q, eomputing the LLRs 
involves the same Euelidean distanee eomputations as those needed for the eo-seheduler user’s 
eonstellation estimation in Q. This faet will be exploited in the hardware implementation in the 
joint eonstellation elassifieation and data deteetion. 


VI. MU-MIMO Receiver Hardware Implementation 
Eigure shows an optimized arehiteeture for a 2x2 MU-MIMO deteetor, eonstrueted using an 
ME MIMO deteetor as its eore. The MIMO deteetor deteets the reeeived signal y* assuming all 
4 possible ehoiees of the interferer’s eonstellation. It generates 4 eorresponding lists of minimum 

distanee metries (i(xj) and their assoeiated symbol veetors Xj for all the |M/| possible hypothesis 

ii) 

of interfering user eonstellation, with x\ E Ms. These distanees and symbols are stored in four 
buffers eaeh of size \Ms\ as shown in the figure. 
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For each tone to be detected, the minimum distance from each list is passed to an adder 
that accumulates the minimum distances over a span of N tones, during which the interferer 
modulation is assumed to be static. The resulting 4 minimum accumulated distances for each 
interferer hypothesis are stored in a buffer. The minimum from this buffer is used to identify 
the interferer’s constellation, and the corresponding stored distances in the buffers are selected 
and forwarded for LLR processing according to Q. 

Note that since the interferer’s modulation constellation remains static over N tones for a 
duration of 1 subframe in LTE (14 OFDM symbols), the particular choice of iV = 12 results 
in substantial savings in computations. The detector only needs to run in the above mode to 
identify the interferer’s constellation for one OFDM symbol in the subframe. It can then switch 
to back to normal ML detection mode (without modulation classification) to generate the LLRs 
for the remaining 13 OFDM symbols for the user of interest x\ . 

Taking the LTE scenario for hardware complexity analysis, the total number of possible tones 
in 1 PRB in a subframe is 12x14 = 168. Of these tones, 28 are reserved for pilots (for cell specific 
reference signals and for UE specific pilots to support the MU-MIMO transmission mode), and 
140 for data. In the hardware architecture of Eig. the total number of distance computations 
needed to generate the LLRs from these 140 data tones is 140 x \Ms\. This corresponds to an 
increase of only 22.8% compared to the distances computed by an ML detector with perfect 
knowledge of the interferer. 


VIE Simulation Results 


The performance of the null projection method of Section IV is compared to the proposed 
method of Section |Vj We simulate the probability of correctly detecting the constellation size 
of the interfering user for different desired and interfering constellation sizes. The received 
desired user power is assumed to be equal to the received interfering user’s power, i.e. the 
signal-to-interference ratio (SIR) is [0]dB. The channel is generated as a zero-mean complex 
Gaussian circularly symmetric with unit variance, independent and identically distributed (i.i.d.) 
tone-to-tone. Ligures and show the results for Ms = 4-QAM and 64-QAM respectively, 
with Ml = 4-, 16-,64-QAM using = 24 resource elements. The figures show that the ML 
classification method has a [5]dB gain over the nulling approach for Ms = 4-QAM and [2]dB 
gain in the case of Ms = 64-QAM. Therefore, the gain of the ML classification method is 
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Fig. 1. Architecture for a 2 x 2 MU-MIMO detector 


largest for small constellation sizes of the desired signal, i,e. the largest gain is attained when 
the receiver complexity is minimal. 

Next, the performance of the ML modulation constellation classification scheme for different 
values of N is investigated. In Fig. we compare the constellation size detection performance 
with = 1 (i.e. a single resource element), 12 and 24 for \Ms\ =4 and different interference 
constellation sizes. Figure shows that iV=12 resource elements are adequate for robust detec¬ 
tion of the interferer’s constellation size. This will be exploited later in the receiver architecture 
to reduce computational complexity and memory requirements. 

Remark: It could be said that increasing N should always improves the performance due to 
better noise averaging. However, from @, we find that, in addition to noise, we have residual 
errors in the minimization of the quadratic term in Q. This residual error occurs when the Xj 
that minimizes the quadratic term is different from the originally transmitted x. The main reason 
for these errors is that the noise exceeds the minimum distance of the lattice constructed by the 
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set of all symbol vectors Xj. Interestingly, we notice that the detection performance with N = 1 
is better than that with = 12 and 24 for very low SNR. This can be explained by the fact 
that adding more samples (increasing N) would add more noise than signal when the SNR is 
very low. Hence, it would be better if fewer samples are used for the detection. 

The BLER performance of the various receivers are compared when both users use 64-QAM, 
with the turbo code of [15| and encoding rate 1/2 using block size 6144 bits. Figure shows 
the results for the case of the pedestrian-B (Ped-B) multi-path channel model with no antenna 
correlations p^ . Figure shows that the joint MF classification and detection method is about 
[O.lJdB away from having perfect knowledge of the co-scheduled user constellation. The MF 
method has about [1.5]dB and [l]dB gain over the nulling method for iV = 12 and N = 
24, respectively. The MMSE method that does not require the estimation of the co-scheduled 
user constellation but only its channel, has about [l]dB degradation over the MF method. We 
simulated the impact of antenna correlation on the BEER performance of these receivers. The 
pedestrian-A (Ped-A) multi-path channel model with high antenna correlations, where both 
transmit and receive correlation coefficients are 0.9 is considered. The effective channel matrix is 
given by H = where He is channel whose entries are uncorrelated and generated 

according to the Ped-A model, Rf and R,. are the transmit and receive antenna 2x2 correlation 
matrices, respectively, which have 1 on the diagonal entries and 0.9 on the off-diagonal. As seen 
in Fig. 1^ the nulling and MMSE methods have significant performance degradation as compared 
to the joint MF method. 


VIII. Conclusions 

A receiver for MU-MIMO transmission in the context of OFDM where two users share the 
same time and frequency resources has been developed. This receiver is based on the joint MF 
modulation classification of the co-scheduled user and data detection which outperforms linear 
receiver approaches. It has been shown that using the max-log-MAP approximation, the decision 
metric for the modulation classification is an accumulation over a set of tones of the Euclidean 
distance metrics which is also used in the detector for FFR computations. An efficient hardware 
architecture emerges that exploits this commonality between the modulation classification and 
data detection steps and results in a sharing of hardware resources. 
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Prob of correctly detecting interferer constellation 
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Desired user constellation = 4-QAM 



Fig. 2. Probability of correct interferer modulation constellation detection versus SNR (dB). Solid lines are for nulling 
approach, dashed are for the ML approach. Desired user constellation is fixed to 4-QAM and the co-scheduled user constellation 
is 4,16, 64-QAM. The channel is i.i.d. block fading. 
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Desired user constellation = 64-QAM 



Fig. 3. Probability of correct interferer modulation constellation detection versus SNR (dB). Solid lines are for nulling approach, 
dashed are for the ML approach. Desired user constellation is fixed to 64-QAM and the co-scheduled user constellation is 
4,16, 64-QAM. The channel is i.i.d. block fading. 
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Desired user constellation = 64-QAM 



Fig. 4. Probability of correct interferer modulation constellation detection versus SNR (dB) for various values of = 1,12, 24 
tones and desired user is 64-QAM. Channel is i.i.d. block fading. 
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Fig. 5. BLER versus per-tone per-antenna SNR (dB). PED-B channel, no correlation, 64-QAM for both users and code-rate 
1 / 2 . 
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Fig. 6. BLER versus per-tone per-antenna SNR (dB). EPA channel, high correlation (0.9), 64-QAM for both users and code-rate 
1 / 2 . 

























































